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Abstract 

This paper describes an algorithm for selecting a consistent set within the 
consistent histories approach to quantum mechanics and investigates its prop- 
erties. The algorithm uses a maximum information principle to select from 
among the consistent sets formed by projections defined by the Schmidt de- 
composition. The algorithm unconditionally predicts the possible events in 
closed quantum systems and ascribes probabilities to these events. A simple 
spin model is described and a complete classification of all exactly consistent 
sets of histories formed from Schmidt projections in the model is proved. This 
result is used to show that for this example the algorithm selects a physically 
realistic set. Other tentative suggestions in the literature for set selection 
algorithms using ideas from information theory are discussed. 
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I. INTRODUCTION 



It is hard to find an entirely satisfactory interpretation of the quantum theory of closed 
systems, since quantum theory does not distinguish physically interesting time-ordered se- 
quences of operators. In this paper, we consider one particular line of attack on this problem: 
the attempt to select consistent sets by using the Schmidt decomposition together with cri- 
teria intrinsic to the consistent histories formalism. For a discussion of why we believe 
consistent histories to be incomplete without a set selection algorithm see and for other 
ideas for set selection algorithms see This issue is controversial: others believe that 

the consistent histories approach is complete in itself [0-^. 

A. Consistent histories formalism 

We use a version of the consistent histories formalism in which the initial conditions are 
defined by a pure state, the histories are branch-dependent and consistency is defined by 
Gell-Mann and Hartle's medium consistency criterion eq. (|1.3| ). We restrict ourselves to 
closed quantum systems with a Hilbert space in which we fix a split 7i = Hi ®7i2, we write 
d\m{7ij) = dj and we suppose that di < d 2 < oo. The model described in sec. [IV] has a 
natural choice for the split. Other possibilities are discussed in [[|. 

Let be the initial state of a quantum system. A branch- dependent set of histories is a 
set of products of projection operators indexed by the variables a = {a n , a n _i, . . . , «i} and 
corresponding time coordinates {t n , . . . , ti}, where the ranges of the and the projections 
they define depend on the values of aik-i, . . . , a±, and the histories take the form: 

C a = PZ n (t n ; «„_!, . . . , ajP^t^; « n _ 2 , ...,«!)... P^ih) . (1.1) 

Here, for fixed values of c^-i, . . . , a%, the P^ k c^-i, . . . , «i) define a projective decompo- 
sition of the identity indexed by so that J2a k PaS^k] a k-i, ■ ■ ■ , «i) = 1 and 

P a k ( tk 'i • • • ' a l)Pa' k ( tk ' afc -!' ■ ■ ■ i a l) = $a k a' k Pa k (tk, • • • , «l) . (1.2) 
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Here and later, though we use the compact notation a to refer to a history, we intend the 
individual projection operators and their associated times to define the history. 
We use the consistency criterion[| 

D aP = 0, Va ^ /3, (1.3) 
which Gell-Mann and Hartle call medium consistency, where D a @ is the decoherence matrix 

D a p = Tr(C aP Cl). (1.4) 

Probabilities for consistent histories are defined by the formula 

p(a) = D aa . (1.5) 

With respect to the 7i = TCi (8> Ti.2 splitting of the Hilbert space, the Schmidt decompo- 
sition of \ip(t)) is an expression of the form 

l#)) = E[ft(i)] 1/2 kW)i®kW) 2 , (i.6) 

i=l 

where the Schmidt states {\wi)i} and ijifj^} form, respectively, an orthonormal basis of TC\ 
and part of an orthonormal basis of Ti.2, the functions Pi(t) are real and positive, and we 
take the positive square root. For fixed time t, any decomposition of the form eq. then 
has the same list of probability weights {pi(i}}, and the decomposition (|1.6Q is unique if 
these weights are all different. These probability weights are the eigenvalues of the reduced 
density matrix. 

The idea motivating this paper is that the combination of the ideas of the consistent his- 
tories formalism and the Schmidt decomposition might allow us to define a mathematically 
precise and physically interesting description of the quantum theory of a closed system. We 
consider constructing histories from the projection operators^ 



For a discussion of other consistency criteria see, for example, refs. [10-13]. 

2 There are other ways of constructing projections from the Schmidt decomposition ||, though for 
the model considered in this paper the choices are equivalent. 



P i (t) = \w i (t)) 1 (w i (t)\ 1 ®I 2 and P = h®h-EiPi(t), (1-7) 

which we refer to as Schmidt projections. If dim7^i = dim7^2 the complementary projection 
P is zero. In developing the ideas of this paper, we were influenced in particular by Albrecht's 
investigations [0,|T^] of the behaviour of the Schmidt decomposition in random Hamiltonian 
interaction models and the description of these models by consistent histories. 



II. INFORMATION 



Recent work [|3[||[T6],[T7| has shown some of the difficulties in formulating a successful 



set selection algorithm. The analysis of |J suggests that in many systems no algorithm 
that constructs sets by proceeding forwards in time will produce the correct physical set. 
If so, an algorithm must consider the entire time evolution of a system if it is always to 
overcome this problem. This paper introduces an algorithm that is global with respect to 
time: the algorithm considers the classQof all consistent sets of histories formed from Schmidt 
projections and selects from among them the one with the greatest Shannon information WE 



Information^ is a term often used in the study of quantum mechanics and is used in 
many different senses. Hartle flT]] f] considers the missing information of a set of histories 



in a generalised spacetime quantum mechanics — he defines the missing information S of a 
set of histories S with initial density matrix p as 

S(S,p)= max E(p'), (2.1) 

p'e{D(S,p')=D(S,p)} 

where D(S, p) is the decoherence matrix for the set of histories S with initial density matrix 
p. Throughout this paper E will denote the Shannon information of a set of probabilities 



3 Class is used as a synonym for set when referring to a set of sets of consistent histories. 
^Entropy or information- entropy are used instead by some authors. 
5 For comments on and corrections to Hartle's paper see [19| 
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or, in the case of a positive definite Hermitian matrix, the Shannon information of its 
eigenvalues^. So, for example, E(p') = — Trp'logp' and 

E(S,p) = J2-D aa \ogD aa , (2.2) 

aeS 

where {D aa } are the diagonal elements of the decoherence matrix D(S,p). Note that if a 
set of histories S is medium consistent then E(S,p) = E[D(S, p)}: generically this is not 
true for weak consistency criteria. 

S(S,p) is the information content of a maximum- entropy [^UJ estimation of the initial 
density matrix given the set of histories and their probabilities — it quantifies what can be 
inferred about the initial density matrix using the set of histories and their probabilities. 
Hartle goes on to define 

S{g,p) = wmS{S,p), (2.3) 

where Q is some class of consistent sets of histories. Computing S(Q,p) for different classes 
enables one to understand different ways information about a quantum system can be ob- 
tained. For example Hartle suggests comparing whether the same information is available 
using homogeneous |21j histories instead of the more general inhomogeneous histories. When 



Q is the class of all consistent sets he calls S(Q,p) the complete information. 

Eq. ( p.3|) could be used as the basis for a set selection algorithm by specifying some 
class of sets of histories Q and selecting a set of histories that produces the minimum in 
eq. ( |2.3j ). This does not work for general classes, since if the class contains sets of histories 
which include projections onto the eigenspaces of p (in non-relativistic quantum mechanics) 
these projections completely specify p, so a rather uninteresting set of histories is selected. 
However, if the initial state is pure and a Schmidt class (a class of sets of histories formed 
from Schmidt projections) is used it will not generically contain a set of histories that 
includes a rank one projection onto the initial state, hence the set of histories selected by 



6 in information theory the singularity for zero probabilities is removed by defining OlogO = 0. 
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eq. Q2.3Q might not be trivial. For instance the set of histories consisting of projections 
P ® I and P (g) I, where P is the projection onto the non-zero system Schmidt eigenspaces, 
has missing information logrank(P ® I). It might be considered unnatural to assume a 
pure initial state and then make a maximum entropy calculation over density matrices of 
other ranks; however, this idea has a more serious flaw. The aim of set selection algorithms 
is to make statements concerning physical events, not merely to supply initial conditions. 
This algorithm only searches for a set of histories that best specifies the initial conditions 
and there is no reason to expect it to produce sets that do more than describe the initial 
conditions. 

Isham and Linden independently, recently proposed a different version of missing 
information, which they call information- entropy, that is simpler and does not use ideas of 
maximum entropy. 

S'(S, p) = - E A» log , (2-4) 

a&s dim (a) 

where 

dim a = ^ (2.5) 

is the normalised dimension of the history, and C a and / are considered as operators in the 
same ra-fold tensor product space p2| , ^3| of 7i. For example, if the history a is defined by 
consecutive projections {P&, k = 1, . . . ,n} then dim (a) = Tr(Pi®- • -®P n )/d n = rank(Pi) x 
•••x rank(P„)/<i n . Like Hartle's missing information, S' decreases under refinements and 
extensions of S. Isham and Linden show that 

min S'(S, p) > — Tr p log p — n log d (2.6) 

and for some examples that the bound is obtained, and they conjecture that the bound is 
attained in general. Isham and Linden also suggest that information-entropy might help in 
the development of a set selection criterion — they suggest that perhaps the minimisation 
should be carried out with respect to a system-environment split. Clearly some restriction on 
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the class of sets used is necessary since bound ( |2.6| ) contains no mention of the Hamiltonian 
or time evolution of the system — simply minimising information-entropy is unlikely to 
produce a good set selection algorithm, since the sets of histories that describes experimental 
situations are much more than a description of the initial conditions. 

Gell-Mann and Hartle discuss similar ideas in detail in ref. |5[]. They introduce a mea- 
sure, which they call total information or augmented entropy, X that combines algorithmic 
information (see for example ref. |24[]), entropy- information and coarse graining. This is an 
attempt to provide a quantitative measure of quasiclassicality. They show that minimising 
S does not provide a useful set selection algorithm — the results are trivial, histories are 
selected that consist of nothing but projections onto the initial state — but they suggest 
augmenting the minimisation with a stronger consistency criterion, 

(a\MlMfs\P) = Pa S a(3 Va ^ /3, M a G M a and Mp G Mp, (2.7) 

where A4 a and M.p are sets of operators. This is an interesting idea. So far however, Gell- 
Mann and Hartle have not proposed a definite algorithm for choosing the Ai a . Without a 
concrete scheme for choosing the sets A4 a the set selection problem of course becomes the 
problem of selecting Ai a . There seems a risk that Gell-Mann and Hartle's proposal also 
has the previously mentioned disadvantage of favouring set of histories that only provide a 
description of the initial state and say nothing about the dynamics, though perhaps with a 
suitable choice for A4p this problem would not arise. 

The approach we present here starts with a precisely defined class of quasiclassical sets 
of histories (formed from Schmidt projections) and picks the set of histories from this class 
with the maximum information. 

It might seem counterintuitive to use a maximum information principle, especially as 
other approaches in the literature to date have looked at minimising measures of information. 
However, these approaches have started with a much larger class of sets of histories. Picking 
the set with largest information from these classes would result in a non-quasiclassical set 
of histories with each history having the same probability. In this approach though, we are 
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using a highly restricted class — the class formed using Schmidt projections. This class 
of histories is so restricted that in some cases it may only consist of sets with projections 
at t — onto the initial Schmidt states. Picking the set with the largest information 
tends to pick the set with the largest number of histories. Other functions of the history 
probabilities could also be used, the essential requirement being that the functions tend 
to increase with the number of projections. We regard this proposal as a starting point 
for further investigations into set selection algorithms — especially since there are only 
pragmatic rather than fundamental reasons for choosing maximum information as a set 
selection axiom. 



III. ALGORITHM 

Let Q(7i, U, be the class of all sets of non-trivial[], exactly consistent, branch- 
dependentQ histories formed from Schmidt projection operators, where 7i = 7i\ ® 7^2 is 
a finite Hilbert space, U(t) a time evolution operator and the initial state. Note that 
in this section the set of histories includes the initial state. The algorithm selects the set 
S G Q with the greatest Shannon information. That is 

max E(S) = max V -p a logp a , (3.1) 

where p a is the probability of history a. The class Q could be chosen differently by using 
any of the consistency or non-triviality criteria from ref. ||. Another variant uses sets 
of histories formed by Schmidt projections onto the system eigenspaces of the individual 
path-projected-states (U(t)C a \ip)), not the total state, so that the choice of projections is 
branch-dependent as well as the choice of projection times. This is likely to be necessary in 

7 In this paper we call a history trivial if its probability is zero and non-trivial if its probability is 
non-zero. 

8 A branch-independent version of the algorithm can be formulated similarly 
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general to produce realistic sets. 

When the initial state is pure, in a Hilbert space of dimension d (= d\d-i) there can only 
be d non-trivial, exactly consistent histories within a setQ. In realistic examples approxi- 
mate consistency may have to be considered. To ensure the algorithm is well defined it is 
important that the number of possible histories is finite, which will only be true if we use 
a parameterised non-triviality criterion or we use a consistency criterion, such as the DHC, 



that can only be satisfied by a finite number of histories |13fl . This is a natural requirement 
for any set of histories in a finite Hilbert space since the exactly consistent sets are finite. 

To show that the maximum in eq. (|3.1| ) exists we define two sets of histories as infor- 
mation equivalent, Si ~ S2, if E(S\) = E(S2)', that is, sets of histories are information 
equivalent if they have the same information. Note that information equivalent sets generi- 
cally are not physically equivalent, but physically equivalent sets are information equivalent. 
Eq. (|3.1| ) selects an information equivalent class of sets of histories that all have the max- 



imum information. Sufficient conditions for eq. (|3.1 ) to be well defined are that Qj ~ is 
closed and that E(S) is bounded. Q itself is not closed, but the only limit sets of histories it 
does not include are those containing zero probability histories, and since zero probability 
histories contribute zero information these limit sets are equivalent to sets which are in Q, 
hence Qj ~ is closed. Moreover these limit sets are also physically equivalent to some of 
the sets that they are information equivalent to, since they only differ by zero probability 
histories — excluding the limit sets does not change anything physical. The information of 
any set of histories in Q is bounded, since the number of histories in any set of histories in Q 
is bounded and the information of a set of n probabilities is bounded by logn. Conditions 
sufficient to ensure uniqueness are much more complicated. It seems likely that a unique 
physically equivalent class will generically be selected, but in special cases it is clear that 
this is not the case. 



9 There can be 2d if weak consistency is used. 
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First we describe some useful properties of this algorithm and then we apply it to a 
simple model. 



A. Completeness 



The set of histories selected by the algorithm cannot be extended (except trivially) 
because any non-trivial extension increases the information content. To see this consider 
the set of histories S and an extension S'. The probabilities for the new histories can be 



which is strictly greater than E(S) whenever the extension results in at least one non-zero 
probability. 



A set of branch-dependent histories has a probability tree structure, where each history 
a refers to a terminal node of the tree and the unique path from that node to the root node. 
The nodes themselves are associated with projection operators and path projected states. 
Define S ak to be the set of all histories extending from the k th node of history a, normalised 
so that the total probability is one. This is a set of histories in its own right which will be 
consistent if the entire set of histories is consistent. Consider a simple example where the 
first projection produces two histories with probabilities p and q and the subtrees from these 
nodes are S p and S q . The information for the set of histories can then be written, 



This formula is easy to generalise. Each subtree must have maximum information subject to 
the constraint that the history vectors span a space orthogonal to the other history states. 
That is, a global maximum must also be a local maximum in each degree of freedom and 
the subtrees are the degrees of freedom. 




(3.2) 



B. Additivity 



E(S) = E({p,q})+pE(S p ) + qE(S q ). 



(3.3) 
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C. Large sets 

One of the problems with the algorithms in ref. || is their tendency to make projections 
too early so that they prevent projections at later times. Other problems also arise with al- 
gorithms that produce histories with zero or small probabilities. The maximum-information 
algorithm will not have these problems, since any projection that prevents later extensions 
is unlikely to be selected, histories with zero probability will never be selected (since they 
contribute no information), and histories with small probabilities are also unlikely to be 
selected. Therefore the algorithm is likely to produce large complicated sets of histories. 

D. Stability 

It is difficult to prove any general results about stability for this algorithm, but it seems 
likely to produce stable predictions for the following reason. The Schmidt projections and 
hence decoherence matrix elements generically will vary continuously with sufficiently small 
changes in the initial state and Hamiltonian, thus the algorithm can be regarded as a con- 
tinuous optimisation problem, and the solutions to continuous optimisation problems are 
stable. 

IV. A SIMPLE SPIN MODEL 

We now consider a simple model in which a single spin half particle, the system, moves 
past a line of spin half particles, the environment, and interacts with each in turn. This can 
be understood as modelling either a series of measurement interactions in the laboratory 
or a particle propagating through space and interacting with its environment. In the first 
case the environment spin half particles represent pointers for a series of measuring devices, 
and in the second they could represent, for example, incoming photons interacting with the 
particle. 
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Either way, the model omits features that would generally be important. For example, 
the interactions describe idealised sharp measurements — at best a good approximation to 
real measurement interactions, which are always imperfect. The environment is represented 
initially by the product of N particle states, which are initially unentangled either with 
the system or each other. The only interactions subsequently considered are between the 
system and the environment particles, and these interactions each take place in finite time. 
We assume too that the interactions are distinct: the k th is complete before the (k + l) th 
begins. 

A. Definition of the model 

We use a vector notation for the system states, so that if u is a unit vector in R 3 the 
eigenstates of a.u are represented by | ± u). With the pointer state analogy in mind, we 
use the basis {| |)fc> I i)k} to represent the k th environment particle state, together with 
the linear combinations |±) fe = (| ± i\ [) k )/\/2. We compactify the notation by writing 
environment states as single kets, so that for example | \) 1 <g> • • • <g> | \) n is written as 
| ti • • • T«), and we take the initial state \ip(0)) to be |v) <S> | Ti • • ■ Tn)- 

The interaction between the system and the k th environment particle is chosen so that it 
corresponds to a measurement of the system spin along the u k direction, so that the states 
evolve as follows: 

|u fc )® | T)*-> |u fc )<g>| T)k, (4.1) 
|-u fc ) <g> | T>fc -> |-Ufc) <8> | i) k - (4.2) 

A simple unitary operator that generates this evolution is 

U k (t) = P(u fc ) 4 + P(-u fc ) ® e - t6 ^ Fk , (4.3) 

where -P(x) = |x)(x| and F k = i\ [)k{] \k — i\ T)fc(l \k- Here 9k(t) is a function defined 
for each particle k, which varies from to tt/2 and represents how far the interaction has 
progressed. We define Pjt(±) = |±) fe (±] fe , so that F k = P k (+) - Pk{~)- 
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The Hamiltonian for this interaction is thus 

H k (t) = iU k (t)Ut(t) = k (t)P(-u k ) ® F k , (4.4) 

in both the Schrodinger and Heisenberg pictures. We write the extension of U k to the total 
Hilbert space as 

V k = P(u fc ) ® h ® ■ ■ ■ ® I n + P(-u fc ) <g> ii <g> • • • <g> ifc_i <g> e -^ ( ' )Ffc <g> 4+i (8) • • • <g> J n . (4.5) 

We take the system particle to interact initially with particle 1 and then with consecutively 
numbered ones, and there is no interaction between environment particles, so that the 
evolution operator for the complete system is 

U(t)=V n (t)...V 1 (t), (4.6) 

with each factor affecting only the Hilbert spaces of the system and one of the environment 
spins. 

We suppose, finally, that the interactions take place in disjoint time intervals and that 
the first interaction begins at t = 0, so that the total Hamiltonian is simply 

n 

H(t) = J2H k (t), (4.7) 
k=i 

and we have that 6i(t) > for t > and that, if 9 k (t) e (0, n/2), then Q^t) = tt/2 for all % < 
k and 9i(t) = for all % > k. 

V. CLASSIFICATION OF SCHMIDT PROJECTION CONSISTENT SETS IN 

THE MODEL 

For generic choices of the spin measurement directions, in which no adjacent pair of the 
vectors {v, Ui, . . . , u„} is parallel or orthogonal, the exactly consistent branch-dependent 
sets defined by the Schmidt projections onto the system space can be completely classified 
in this model. The following classification theorem is proved in this section: 
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Theorem In the spin model denned above, suppose that no adjacent pair of the vec- 
tors {v, iii, • • • ) u n} is parallel or orthogonal. Then the histories of the branch-dependent 
consistent sets defined by Schmidt projections take one of the following forms: 

(i) a series of Schmidt projections made at times between the interactions — i.e. at times 

t such that 9k(t) = or 7r/2 for all fc. 

(ii) a series as in (i), made at times ti, . . . ,t n , together with one Schmidt projection made 

at any time t during the interaction immediately preceding the last projection time t n . 

(iii) a series as in (i), together with one Schmidt projection made at any time t during an 
interaction taking place after t n . 

Conversely, any branch-dependent set, each of whose histories takes one of the forms (i)-(iii), 
is consistent. 

We assume below that the set of spin measurement directions satisfies the condition of the 
theorem: since this can be ensured by an arbitrarily small perturbation, this seems physically 
reasonable. The next sections explain, with the aid of this classification, the results of various 
set selection algorithms applied to the model. 

A. Calculating the Schmidt states 

Eq. (|4.3| ) can be written 

Uj(t) = e -^W p (- u ^ <g> Pj(+) + e ^m(-^) g, p 3 .(_). (5.1) 

Define x + j(t) = exp[— %9j{t)P{— Uj)] and X-j(t) = x' + j(t) so Uj(t) = x + j(t) <g> Pj(+) + 
X-j(t) <8>Pj(—)- Let 7r be a string of n pluses and minuses, \ir) denote the environment state 
|7ri)i <S> ■ ■ ■ <8i |7r n ) n , P(ir) = and x n (t) = x Wnn (t) . . .x ni i(t). Then 

[/(t) = ®P(tt). (5.2) 
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The time evolution of the initial state 1^(0)) = |v) (g> | |i • • • Tn), the corresponding reduced 
density matrix and the Schmidt decomposition can now be calculated, 

IV(*)> = ® p Wl v ) ® I ti ••• Tn} = 2^ n/2 E^( t )l v ) ® k>> ( 5 - 3 ) 

7T 7T 

since P(tt)| ti • • • Tn) — 2~ n / 2 |7r). The reduced density matrix is 

Pr (t) = Tr E [\m)(m\] = 2- n J2x n (t)P(v)4(t). (5.4) 

This can be further simplified by using the homomorphism between SU(2) and SO(3). 
Define the rotation operators 

B +k (t) = P(u fc ) + cos0 fc (t)P(u fc ) - sin0 fe (t)u k A, (5.5) 

B-k(t) = B+ k (t) and B n j k (t) = B nkk (t) . . . B v .j(t). B +k {t) corresponds to a rotation of angle 
k {t) about Ufc, and P{u k ) = UfcUjf , a projection operator on P 3 . Note that P(u*;) is also used 
to indicate a projection in the system Hilbert space — its meaning should be clear from the 
context. B w i n (t) will usually be simplified to B n (t). Then x 7ri i(t)P(v)x^ ll (t) = P[B nil (t)v}. 
Eq. (|5.4|) can then be written 

Pr {t) = 2- n Y,P[ B ^]- (5-6) 

7T 

Define Aj{t) = l/2[B +j (t) + £_,-(*)] = P(u J ) + cos%(t)P(uj) and A jk (t) = A k (t) . . . Aj{t), 
then 2~ n J2^B 7r (t) = A ln (t). A ln (t) will usually be written A(t). Since P[B % (t)v] is linear 
in B v (t) the sum in eq. ( |5.6| ) can then be done, so 

1 + a.A(t)v 

Pr(t) = 2-^-. (5.7) 

Generically this is not a projection operator since |A(i)v| may not equal 1. It is convenient 
however to define P(y) = 1/2(1 + er.y) for all y G C 3 , and this extended definition will 
be used throughout the paper. P(y) is a projection operator if and only if y is a real unit 
vector. Eq. ( p.7|) can now be written as p r (t) = P[A(t)v]. 

The eigenvalues of eq. ( |5.7| ) are 1/2 [1 ± AT(t)] and the corresponding eigenstates, for 
N(t) ^ 0, are | ± w(t)), where iV(t) = |A(t)v| and w(t) = A(t)vJV -1 (t). 
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Lemma 1. Sufficient conditions that N(t) 7^ for all t are that 9i(t) > 9j(t) for all i < j 
and Uj.u i+1 7^ for alii > 0. 

Proof. Suppose 3t s.t. JV(t) = 0, => det A(t) = 0, 3j s.t. detA,(t) = 0, 0j(t) = tt/2. 
Let j be the largest j s.t. ^(t) = vr/2, then = P(u;)Vz < j and detA(t) 7^ (M > j, 

iV(£) = ||A(j + i) n (£)u.j|| rij>i>o l u i- u i+i| an d detAy +1 ) n (t) 7^ 0, =^ 3? s.t. |u;.u i+1 l = 0# 

For the rest of this paper it will be assumed that and {uj} satisfy the conditions 
of lemma 1. The condition on the {#;} holds so long as the environment spin particles are 
further apart than the range of their individual interactions. The condition on {u«} holds 
generically and is physically reasonable since any realistic experiment will not have exact 
alignment. 

B. Decoherence matrix elements 

The Heisenberg picture Schmidt projection operators are 

P§(t) = U^(t)P[±w(t)\ <g> I E U(t). (5.8) 

Eq. ( |5.8|) can be rewritten using eq. (|5.2|) 

Pn(t) = E xUt)P[±w(t)]x„(t) ® P(tt) = £ P[±w w (t)] ® P(vr), (5.9) 

where w w (*) = £j(i)w(t). 

Consider the probability of a history consisting of projections at time t and then s, where 
the projectors are Schmidt projectors. 

p(±±) = \\PH(s)p^t)\m)\\ 2 - (5.10) 

Eq. ( |5.1Up simplifies using eq. ( |5T9"D and P(tt)\i/j(0)) = 2 -1 / 2 |v) (g> \ir) to become 

P (±±) = e iip[±w.( s )]p[±w.(t)]|v) ® p(vr)i ti . . . uir 

7T 

= 2-"~ 2 £[1 ± w^).v] [1 ± w w (t).w T (s)]. (5.11) 

The off-diagonal decoherence matrix elements can be calculated similarly. 
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(m\p^t)p^s)ps(t)\m) 

= 2-^Tr{P(v)P[±w 7r (t)]P[±w 7r ( S )]P[ T w 7r (t)]} 

7T 

= 2~ n ~ 2 £[w ff (t) A v].[±w ff (*) A w w (s) ± tw n (s)} . (5.12) 

For a general set of vectors {u^} and time functions {8k} eqs. ( |5.11| ) and ( |5.12[ ) are very 
complicated. However, with a restricted set of time functions a complete analysis is possible. 
The functions {8k} are said to describe a separated interaction if, for all t, there exists k s.t. 
8j(t) = 7r/2 for all j < k, and 8j(t) = for all j > k. For separated interactions a projection 
time t is said to be between interactions j and j + 1 when = 7r/2 for all i < j and 
8i(t) = for alH > j. A projection time t is said to be during interaction j when 9i(t) = ti/2 
for all i < j, 9i(t) — for all i > j and < < 7r/2. Separated interactions have a 
simple physical meaning: the interactions with the environment spins occur distinctly, and 
in sequence. 

Under this restriction a complete classification of all the consistent sets, both branch 
dependent and branch independent, is possible. This classification has a particularly simple 
form for generic v and {u^} satisfying u^.u^ + i 7^ 0, and Au^i 7^ for all k — 0, . . . ,n — 1. 
Recall u = v. For weak consistency the second requirement is stronger (u fc A u fc+1 ).(u fe+2 A 
u fc+i) = UfcP(u,fc + i)ufc + i 7^ 0. These assumptions will be assumed to hold unless stated 
otherwise. 

C. Classification theorem 

The proof first considers projections at two times and shows that a pair of times gives 
rise to non-trivial consistent histories only when the earlier time is between interactions or 
the earlier time is during an interaction and the later time between this interaction and 
the next. The second part of the proof shows that any set of branch- independent histories 
consisting of branches that satisfy this rule for all pairs of projections is consistent. The 
proof holds for weak and medium consistency criteria. 
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1. Allowed histories 



Let t be a time during interaction j. Define u = 9j(t) and <fi = 6j{s). Define 
x = Ai (i _ 1 )(s)v = Aiy_!)(*)v and y = Ag +1)n (s)Ai„(s)v. Note B vln (t) = B^t) and 
-SttIO-i) (t) = Ariy-i)(s) since t < s. With this notation and using simple vector identities 



the off-diagonal elements of the decoherence matrix (from eq. 5.12 ) are 



2-("'+ 2 )]T[w(t) A fl ff (t)v].[±w(t) A B„(t)w„(s) ±iB n (t)w^s)}. 



(5.13) 



Now 



B v (t)w v (s) = S wi (t)S wl0 -_ 1) (t)Sj 1(j -_ 1) (s)SS n (s)w( S ) = ^(t)S5 n ( S )w( S ), (5.14) 

which only depends on 7Tj for i > j. Since B wl j(t)v only depends on 7Tj for z < j the sum 
eq. (|5.14j) can be done over all 71$, i ^ j. 



2 1 ' j B„i J (t)v = [A j (t)-ir j fanwu j A]A 1( j- 1) (t)v 

TTi,i<j 

= w(t)N(t) — iTj sin a; u,, A x, 



(5.15) 
(5.16) 



J2 B^(t)B^ jn (s)w(s) = N- 1 (s)B 7rj (t)B^(s)Al +1)n (s)A ln (s)y 



N- 1 (s)B nJ (t)B^(s)y. 



(5.17) 
(5.18) 



Substitute these last two results into eq. ( |5.13|) which becomes 



2~ 3 iV^ 1 (s) £{w(i) A [w(t)N(t) - TTj sinwu,- A x]} 



.[±w(t) A B vj (t)BUs)y ± iB^(t)BUs)y\. 



(5.19) 



This can easily be simplified since w(t) A w(£) = 0. The only remaining term in the first 
bracket is then linear in ttj, so when the sum over ttj is taken only the terms linear in ttj in 
the second bracket remain. Eq. ( |5.19| ) is therefore 



l/4/V -1 (s) sincjsin(u; - <j>)[w(t) A (uj A x)].[w(i) A (uj Ay) ± iu, A y]. (5.20) 



Now w(t) = [P(uj) + coscjP(u i )]xiV- 1 (t) so w(i).(xAiij) = 0. Therefore 



[w(t) A (u, A x)].[w(t) A (u, A y)] = x J P(u,)y 



Also Uj.w(t) = Uj.-xN~ 1 (t) so 



[w(t) A (uj A x)].(u, A y) = -^(^(u^x)^^ A y) 



(5.21) 



(5.22) 



Eq. ( |5.19| ) can be simplified using eq. (|5.21[) and eq. ( |5.22| ) to 



l/4/V _1 (s) sinu sin(0 - cj){±x T P(uj)y ± ziV-^Xuj.xJu^x A y)} (5.23) 

The probabilities can be calculated during the same results. Summing all the terms i ^ j 
in eq. ( |5.11|) results in 

2 2^11 ± w(t).|w(t)iV(t) — 7Tj sinwUj A xjj < 1 ± — v 



N(s)N(t) 



2 _2 [l±iV(t)] <^ 1 ± 



x T [P(uj) + cosu;cos(0 — uj)P(uj)]y 



(5.24) 



N(s)N(t) J 
N 2 (s) = \Ai n (s)v\ = x T A,-(0)y and cos(u; — 0) cosu; — cos0 = sinu sin(0 — a;), so eq. ( |5.24[ ) 



is 



l/4[l±JV(f)] 



iV 2 (s) + sinu; sin(0 — c<j)x T P(uj)y 
N(s)N(t) 



(5.25) 



To write the decoherence matrix without using x and y it is necessary to consider three 
cases: when times s and t are during the same interaction, when they are during adjacent 
interactions and when they are during separated interactions. If t is during interaction j 
and s during interaction k the three cases are k = j,k = j + l and k > j + 1. For the 
remainder of this section let <fi = 0k(s), 



Nj(u) = \Aj(t)uj-i\ and A^- = |u fc .u fe+ i| 

j>k>i 



(5.26) 



Then 
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x - Ao(j-i)Uj_i 
N(t) = Xou-iMu) 
N{s) = \o(k-i)Nk((p) 

Aoy_i)^(s)Uj_i for fc = j 

AojA^^s)^- for fc = j + 1 

A(j+i)(/fc-i)Ao(fe-i)AT|(0)u i+ i for k > j + 1 

The probabilities of the histories (eq. |5.24j) are 



(5.27) 
(5.28) 
(5.29) 

(5.30) 



p(±±) = 1/4[1 ± A 00 -_ 1) iV J (^)][l ± a] 



(5.31) 



where 



TV 2 (0)+sinaj cos </<sin(</<— aj)|uj_i AUj | 2 
A (j _ 1); ;Af 2 +1 ((/))+coswsina;A 2 . +1) sin 2 ^uT^Pju^u^ 



N k (<i>) 



A( j _ 1 )( fc _ 1) +A( j+ i)( fc _ 1 ) cosa;sirmuJ_ 1 P(u J )u J+ i 



for k — j 
for = j + 1 
for k > j + 1 



The nonzero off-diagonal terms are (eq. |5.23| ) 



Aq(j_i) sinoj sin(</>— oj) cos 0|uj_i Auj | 2 

A ( J _ 1 )A J - (j+1 ) sinojcosajsin 2 < />[A r J (w)uJ_ 1 P(u J )u :l+ i±iA {j _ 1)j u : ,„i.(u j Au j+ i)] 

4JVj (w) 



for = j 
for = j + 1 
for k > j + 1. 



(5.32) 



(5.33) 



The off-diagonal terms can be zero for two reasons, either there is a degeneracy in the 
measurement spin directions, or s and t take special values. The necessary and sufficient con- 
ditions for the measurement spin directions not to be degenerate is that for all j Uj.Uj +1 ^ 
and Uj A Uj+i ^ 0. The first condition ensures that A^ 7^ for all i and j and that the 
Schmidt states are well defined. These cases do not need to be considered when we are 
interested in exact consistency because they have measure zero and almost surely under 
any perturbation the degeneracy will be lifted. If weak consistency is used only the real 
part needs to vanish and the measurement direction need to satisfy the stronger condition 
uJ_ 1 P(u ? )uj +1 7^ for all j. This is still of measure zero. If approximate consistency is being 
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considered the situation is more complicated as the histories will remain approximately con- 
sistent under small enough perturbations. This will not be considered in this letter. Unless 
said otherwise it will be assumed that the measurement spin direction are not degenerate. 

Therefore from eqs. ( |5.33| ) the only pairs of times giving rise to consistent projections 
are repeated projections (that is s = t which implies j = k and to = (/)), projections in 
between interactions and any later time (that is u = or 7r/2), and a projection during an 
interaction and a projection at the end of the same interaction (that is j = k u G [0, 7r/2] 
and = 7r/2.) 

2. Probabilities of allowed histories 

The model is invariant under strictly monotonic reparameterisations of time, t — > f(t). 
Therefore for separated interactions no generality is lost by choosing the time functions {6j} 
such that the j th interaction finishes at t — j, that is 6i(j) = tt/2 for all i < j and 6i(j) = 
for all i > j. It is convenient to define R^j = [P{ u i) — tTiUiA] . . . [P(iij) — 7TjUjA]. Then 
B n (m) = R n im- 

Consider the history a that consists of projections at times {m 8 : % = 1, 2, . . . I}, then at 
time it 6 (k — 1, k) and then at time k, where {rrij, k} is an ordered set of positive integers. 
This history means that the particle spin was in direction ±u mi at time i = 1, . . . , I, 
direction ±w(£) at time t and direction ±u^ at time k. Define uo = v and mo = 0. 

Using the same method as for two projections the probability for history a is 

i-i 

p a = 2- n 2- {l+2) lit 1 + a««,+iw> ! ).w 1 (m j+1 )] 

7T 1=0 

x [1 + aia t -w^(mi).w n (t)] x [1 + a t a k -w ^(t) .w ^(m k )} (5.34) 

Now 

w 7r (m i ).w 7r (m i+ i) = u^i? x i mi ^ lmi+i u mt+1 = u^.i? 7r ( mi+ i) m<+1 u m<+1 , (5.35) 
which only depends on Tij for m i+ i > j > rrii. Also 
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(5.36) 



which is independent of it and 



w 7r (t).w 7r (m i ) = N k 1 (t)ul_ 1 A k (t)B 7Tkk (t)R 7T{mi+1)(k _ 1) u, 



(5.37) 



which only depends on ttj for j > These last three equations show that each B nii is 
linear so the sum over 7r is trivial and each B nii can be replaced by Aj. 



2 mi mi+1 X J2 w ff (m i ).w ff (m i+ i) = u^P(u mi+1 ) ■ • • P(u mi+1 _ 1 )u mi+1 = A m .^ +1J (5.38) 



2 m ^ fc ^ w ^(t).w w (m / ) = N^uZ^Alftuk-xXrnM-!) = \ m (k-i)N k (t) (5.39) 



Pa = 2" ( ' +2) [l + a,a t A m , (fc _i)iVfc(t)][l + ^^^(^(u^.Ufe)] +a i a i +iA rajraw . 1 ]. (5.40) 



Since a coarse graining of a consistent set is consistent it is sufficient to only consider the 
off-diagonal decoherence matrix elements between the most finely grained allowed histories, 
which are those that consist of projections between all interactions and one projection during 
the interaction before the final projection. The off-diagonal elements of the decoherence 
matrix arise from only three forms, which depend on where the two branches separate, that 
is the earliest projector where they differ. 

First consider the case where two histories differ at a projection in between interactions 
and all projections up to that point have also been in between interactions. Let C a = 
QaPu{k) . . . Ph (1) and Cp = QpPu{k) . . . Ph(X)- The decoherence matrix element between 
them is 



7Tj, k > i > mi 



Using these results to do the sum over all n eq. ( |5.34j) is 



i-i 



i=0 



3. Consistency of allowed histories 



2~ n Y, Tr{Q n P(n k )x n (k)P[w n (k - 1)] . . . P[w w (l)]P(v) 



TT 




(5.41) 
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where Q n = (n\x n (k)Ql ( Q l 3xl( y k)\TT). Since Q a and Qp only contain projections after in- 
teraction k has completed Q n is independent of TCj for all j < k. Now P[w 7T (j)]P[w n (j — 

l)]P[w w 0')] = 1/2(1 + Ui-i^PKO")]. Let n = 2 1 --no<i< m (l + %-x.u,-) and eq. O 
is 

fi2~ n £ Tr{Q v P(u k )P[B^k)w v (k - l)]P(-u fc )} (5.42) 

But l/2E,/[5,(fc)w,(fc - 1)] = Pfu^Ufc-Ufc-O] and P(u k )P[u k (u k .u k ^)]P(-u h ) = so 
eq. (|5.42|) is zero. 

Now consider C a = P H (k)P H (t)P H (k - 1) . . . P H (1) and Cp = P H (k)P H (t)P H (k - 
1) . . . Pff(l). The decoherence matrix element between them is 

//2- n ^Tr{P[w 7r (A:)]P[w 7r (0]P[w 7r (A ; -l)]P[-w 7r (t)]P[w 7r (A ; )]}, (5.43) 

which, because B^Uk = equals 

^T n J2 Tr{P(u fc )P[w(t)]P[P 7rfcfc (t)u fc _ 1 ]P[-w(t)]P(w(A;)}. (5.44) 

7T 

The sum over iik can be done to give P[w(t))P[Ak(t)uk-i)P[—w(t)], and since w(t) is parallel 
to Ak(t)uk-i, eq. (|5.44j) is zero. 



The final case to consider is when then the histories a and (3 differ in their final projection. 
They will be trivially consistent. 

VI. THE ALGORITHM APPLIED TO THE SPIN MODEL 

A set of histories that maximises information must be complete, therefore all histories 
must consist of projections at times {1, . . . , k — 1, t, k : t G (A; — 1, A;)}. First we show that 
k must be the same for all histories, then we show that generically k — n. That is, the 
algorithm selects a branch independent set that generically describes a measurement at the 
end of each interaction plus one measurement during the final interaction. 

The information content of two subtrees rooted at the same point only depends on 
the projection times within each one. Either the two subtrees have the same information, 
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in which case their projection times must be the same, or one has more, but since the 
projection times used in the subtree with greater information will also be consistent if used 
in the subtree with less information these projection times can be used instead. Therefore 
in the set with maximum information all the subtree must have the same projection times, 
thus all the histories must have the same projection times — the maximal set is branch 
independent. 

Let the projection times be {1, . . . , k — 1, t, k : t £ (k — 1, k)}. Then from eq. (|5.40f) and 
eq. (|3.2|) the information content of this set is 

f[N k (e k (t))\ + /[(ufc.Ufc-OJVfc- 1 ^^))] + £ ,/iu ; ..n,-} (6.1) 

fc>j>0 

where 

f( x ) = ^— log — — log — ^— . (6.2) 



Maximising eq. (|6.1| ) with respect to t yields 

E(S k ) = E k = 2f(\u k .u k _ 1 \ 1 / 2 )+ ]T /(u, ,.u ; ). (6.3) 

k>j>0 

where S k is the branch independent set consisting of projections at times {1, . . . , k — 1, t k , k}. 
This is usually maximised by k = n but depending on the relationships between the ll, any 
value of k may be possible. For example, consider u 3 -_i.Uj = 1 — e for all j ^ k and 
Ufc_i.Ufc = e and e is small. 

O(eloge), for m < k, 

21og2 + 0(eloge), for m = k, (6.4) 
log 2 + O(eloge) for m > k, 

which for small e is maximised by E k . 

The precise relationship between the {iij} that ensure E n E k for all k < n is complicated 
in detail, but simple qualitatively. Roughly speaking, E n < E k only if |u,-_i.Uj| ^> |ujt_i.Ufe| 
for all j > k, that is all the measurement directions must be approximately parallel after 
the k th . Monte Carlo integration over {uj} (with the £0(3) invariant measure) shows that 
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for n = 3 set S n is selected 85.7% of the time, for n = 4 it is selected 84.3% of the time, and 
for all n > 4 it is selected 84.2% of the time. When the vectors are approximately parallel, 
that is |uj_i.Uj| = 1 — 0(e), set S n is selected with probability 1 — 0(e). If however all the 
measurement spins are approximately parallel (|uj_i.Uj| > 1 — e, and — neloge < 41og2) 
then for some orientations of the initial system spin (v = u ) E x > E k for all E k so set S 1 is 
selected. That is, the maximal set consists only of a projection during the first interaction 
and at the end of the first interaction. 

Though the results of the algorithm may seem counterintuitive the following discussion 
shows why this is not a problem. 

First consider the case when the system is genuinely closed. All the projections before 
the last interaction are natural, in the sense that they agree with our intuitive understanding 
of a measurement type process. It is only the projections during the last interaction, which 
occur when the set of histories is nearly complete, that are unnatural. Our intuition about 
the system and the result we believe to be correct relies on the experiment being embedded 
in a larger system in which the sets of histories considered are always far from complete. 

Second consider the case where the system is approximately closed. Then the sets S k 
should describe the first projections of a maximum-information solution in a larger Hilbert 
space. For reasons explained below, no non-trivial projections onto the system space will 
result in consistent extensions of the sets S k , even if the system interacts with new degrees 
of freedom in the environment. This shows that though it is a maximum-information set for 
a subsystem, it is unlikely to be part of the maximum-information set for the entire system. 
The set most likely to be part of the maximum-information set is the natural set, the set 
that consists of projections only at the end of each interaction. 

The set of normalised histories (in the Schrodinger picture at time k, that is the path- 
projected states) is 

S k = {|« v fc ) <g> |ai(T), • • • , afc-i(T), «fc(-0, Tfc+i, • • • , Tn)Va G Z% +1 }, (6.5) 

where a is a string of 2 fc+1 plusses and minuses, +(T) =t\ — (T) — I I) an d ±(— >) are 
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orthogonal vectors depending on Ufc_i and u^. This set of histories cannot be non-trivially 
extended with Schmidt projections (see sec. |V|). The reason for this is clear. Consider two 
of the histories |±Vfe) ® |e) where |e) is the environment state. These histories are only 
orthogonal because of the orthogonality of the system part of the states. There can be no 
future non-trivial extensions unless there is an exact degeneracy, because consistency terms 
between these two histories will contain terms like |(v|P(w)|v)| = ^1/2(1 + v.w), which is 
only zero when w = —v. In contrast if projections are only made at the end of interactions 
all the histories are orthogonal in the environment Hilbert space of the finished interactions. 
Unless these interactions are "undone" these histories will always remain orthogonal and 
cannot interfere. This argument suggests that the true maximum-information set for the 
total Hilbert space starts of with projections at the end of every interaction but at no interior 
times. 

This suggests that an algorithm designed to produce a maximum-information set for a 
subsystem could be constructed by requiring that all the histories in a set were orthogonal 
in the environment space, that is the reduced density matrices in the environment Hilbert 
space for each history are orthogonal. This is equivalent to considering sets of histories that 
satisfy the strong consistency criterion ( ^.7[ ) when the set {A4 a } is chosen to be {P <g> I : 
for all projectors P on Tii}. 

VII. OTHER ALGORITHMS 

Let Q (7i, U, be the class of all sets of non-trivial, exactly consistent, branch- 
dependent histories formed from Schmidt projection operators in the spin model. Consider 
an algorithm that selects the set in Q that minimises Isham and Linden's information- 
entropy ( |2.4| ). Due to the special symmetries of the spin model the selected set will be 
branch independent — the argument at the start of section ( |VTD is valid. 

Consider the set of projections at m times, so that the normalised dimension of each 
history is 2~ m . Information-entropy for this set is 
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S' = - £ Pa log = -2m log(2) -$>« logp Q . (7.1) 

Using the notation of the previous section this can be written 

S" = - £ [2 log 2 -/(a*)], (7.2) 

m>A;>0 

where the a& depend on the projection times and vary between —1 and 1. Since f(x) < log 2 
each term in the sum is always negative so the minimum occurs for m = n + 1, and the 
selected set consists of projections at the end of every interaction and a projection either 
at the end or the beginning of the last interaction — the algorithm has selected a natural 
set. The minimum-information-entropy algorithm selects a set with as many projections as 
possible, and among these sets it selects the set whose probabilities have the lowest Shannon 
information. One drawback with this approach is that unless trivial histories are excluded, 
or the number of histories in a set bounded, the minimum may not exist and the algorithm 
would therefore be ill defined. In particular if an infinite number of repeated projections are 
allowed the algorithm is ill defined. 

VIII. CONCLUSIONS 

This paper defines a precise algorithm for making probabilistic predictions for closed 
quantum systems. The algorithm considers the class of all non-trivial, exactly consistent, 
branch- dependent sets of histories defined by Schmidt projections with respect to a fixed 
split of the Hilbert space and selects from among them the set with the maximum Shannon 
information. The algorithm avoids many of the problems of the algorithms considered in 
ref. 0. Because it considers the entire time evolution of a system - roughly speaking it is 
global in time, whereas the algorithms in ref. are local — it does not make unphysical 
projections in systems where recoherence occurs and it produces complete sets of histories 
that describe the correlations between the system and the environment. Trivial and very 
small probability histories, which cause problems for some of the algorithms considered 
in ref. |§ by preventing later physical projections, are unlikely to be selected since they 
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contribute little information. The algorithm is also likely to be stable under perturbations 
in the initial conditions, the Hamiltonian and the parameters, since it involves maximising 
a continuous function. 

Section [VT| has shown that the algorithm selects a natural set for a simple spin model. It 
would be interesting to test out the algorithm on more realistic examples; however, it seems 
difficult to apply the algorithm directly, because of the large size and complicated nature of Q. 
Analytic calculations are only possible when the system is very simple and in more realistic 
examples computer simulations will be necessary. However, it should be possible at least 
to get some insight into the algorithm's predictions by maximising subject to constraints, 
that is by considering a more computationally tractable subset of Q. For example, we could 
choose a time interval T that is greater than the time of individual interactions (within the 
particular system) and larger than any timescale over which recoherence occurs. This would 
be used as a moving time-window over which to perform the maximisation. The earliest 
projection within each time-window would be selected and the next time-window would 
commence from that time. Such algorithms should select the same set as a global algorithm 
if T is large enough, and are also independently interesting. 

Because the algorithm predicts the probabilities for events and the set of possible events 
the algorithm is falsifiable: the algorithm is wrong if it selects any sets that do not agree 
with our experiences. The algorithm can also be applied to situations where we have no 
experience of what the natural sets of histories are: for example, a (finite) closed system of 
electrons and photons — and perhaps ultimately could be applied to theories of quantum 
cosmology. 
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